Interface Focus — Latest Matching Preprints

1

Pre-infusion Exhaled breath volatile organic compounds predict severe CRS and ICANS after CAR T-cell therapy

Berna, A.; Fahrmann, J.; Irajizad, E.; Rudsari, H.; Liu, Y.; Logan, J.; Murtada, K.; Grandy, J.; Edwards, M.; Ayers, A.; Ahmed, S.; Neelapu, S.; Saini, N.; John, A.; John, T.

2026-06-01 oncology 10.64898/2026.05.28.26354352 medRxiv

Top 1.0%

0.3%

Show abstract

Background: Severe cytokine release syndrome (CRS) and immune effector cell-associated neurotoxicity syndrome (ICANS) are major dose-limiting toxicities of chimeric antigen receptor (CAR) T-cell therapy. Existing pre-infusion biomarkers offer modest discrimination, motivating non-invasive alternatives. Methods: We prospectively enrolled 26 patients with relapsed/refractory large B-cell lymphoma receiving axicabtagene ciloleucel. Pre-infusion (day -1) exhaled breath samples were analyzed by gas chromatography-mass spectrometry for 40 volatile organic compounds (VOCs). Candidates with univariate AUC > 0.65 for severe (grade >=2) CRS or ICANS were carried forward to sensitivity-maximization-at-given-specificity with LASSO regularization (SMAGS-LASSO), which selected separate panels for each outcome. Model performance was assessed by leave-one-out cross-validation with permutation p-values and Harrell bootstrap optimism correction. Results: The 4-VOC CRS panel (heptanal, benzaldehyde, 2-butanone, ethylbenzene) achieved LOOCV AUC 82.5% (80% sensitivity at 88% specificity) and the 3-VOC ICANS panel (nonanal, allyl methyl sulfide, levomenthol) achieved AUC 86.3% (67% sensitivity at 86% specificity). By tertile, severe CRS occurred in 8/9 (89%) high-risk versus 2/9 (22%) low-risk patients (Cox HR 6.82, 95% CI 1.41-32.9, p=0.017) and severe ICANS occurred in 8/9 (89%) versus 2/9 (22%) (HR 8.28, 95% CI 1.73-39.6, p=0.008). Each 1-SD score increase corresponded to a 3.80-fold higher hazard of severe CRS (p<0.001) and 4.36-fold higher hazard of severe ICANS (p<0.001). In head-to-head comparison, the 3-VOC ICANS panel outperformed the modified Endothelial Activation and Stress Index (mEASIX) (delta-AUC +0.36, DeLong 1-sided p=0.008). The 4-VOC CRS panel had numerically higher AUC than mEASIX (delta-AUC +0.19, p=0.150). Conclusions: Pre-infusion exhaled breath VOC panels stratify CAR T-cell recipients by severity and timing of severe CRS and ICANS, providing a non-invasive complement to existing serum biomarkers. Multi-institutional validation is warranted.

2

A priority index-based computational medicine framework (PimRNA) for prioritising personalised mRNA cancer vaccines

Fang, H.; Tan, T.

2026-05-29 oncology 10.64898/2026.05.26.26354114 medRxiv

Top 1%

0.2%

Show abstract

Background: The development of personalised mRNA cancer vaccines holds considerable promise for oncology, yet a significant translational gap persists between neoantigen identification and the selection of therapeutically impactful targets. Current approaches predominantly prioritise human leukocyte antigen (HLA) binding affinity and immunogenicity, often overlooking the systems-level biological context of the target. This can inadvertently favour immunogenic but biologically peripheral peptides that exert limited influence on tumour signalling networks, thereby constraining vaccine efficacy. Furthermore, mRNA therapeutics must satisfy additional design requirements, including favourable codon usage and favourable secondary-structure stability, which directly affect in vivo translation and half-life. A unified computational framework that integrates neoantigen discovery with network biology is therefore critically needed. Results: Here, we present PimRNA, a Priority index (Pi)-centric computational medicine framework that bridges this gap by unifying neoantigen identification, mRNA sequence optimisation, and gene interaction network analysis. First, high-confidence tumour-specific HLA class I and II neoantigenic peptides are identified from paired tumour-normal genomic and tumour transcriptomic data using NeoDisc. Second, the coding sequences of these peptides are optimised for stability and translational efficiency with LinearDesign, yielding a core set of neoantigen-encoding mRNAs. Third, a random walk with restart algorithm is applied to a knowledgebase of gene interactions to identify peripheral genes exhibiting significant network connectivity to core genes, generating a gene-predictor matrix in which each gene is assigned an affinity score reflecting its network proximity to immunogenic neoantigens. These scores are consolidated into a single, unified priority rating (0-5) for each gene, followed by subnetwork analysis that reveals therapeutically relevant gene modules. Application of PimRNA to breast cancer and melanoma datasets demonstrates that it successfully selects high-confidence immunogenic neoantigen candidates embedded within biologically meaningful tumour-specific networks. Conclusion: PimRNA provides a systems biology foundation for mRNA vaccine design, moving beyond isolated immunogenicity to prioritise targets that are both highly presented and central to tumour-relevant biological networks. This framework offers a generalisable strategy for the rational discovery and prioritisation of mRNA therapeutics, significantly advancing the field of computational medicine towards personalised cancer vaccines.

3

Survival and neurologic outcomes after re-irradiation in children with diffuse midline glioma and diffuse intrinsic pontine glioma

Vaziri, T.; Vyas, D.; Alhumaid, M.; Lucas, C.-H.; Guryildirim, M.; Kilburn, L.; Gartrell, R. D.; Koldobskiy, M. A.; Raabe, E.; Cohen, K.; Ladra, M.; Acharya, S.

2026-06-01 oncology 10.64898/2026.05.29.26354429 medRxiv

Top 2%

0.2%

Show abstract

Background: Reirradiation (reRT) is increasingly offered following progression in diffuse intrinsic pontine glioma (DIPG) and diffuse midline glioma (DMG), though optimal patient selection remains a challenge. This study evaluated clinical outcomes after reRT in a contemporary cohort of patients with DIPG/DMG. Methods: Patients <26 years old with DMG/DIPG treated with radiation therapy between 2011-2025 were retrospectively reviewed. Primary endpoints included overall survival (OS2) and progression-free survival (PFS2), measured from first progression, and change in neurologic symptoms after reRT. Survival was estimated using Kaplan Meier methods, with Cox proportional hazards modeling for prognostic factors. Results: Fifty eight patients were included; 37 (63.8%) underwent reRT. Tumors were predominantly pontine (74.1%). ReRT was associated with improvement in motor function (51.4% vs. 9.5%, p=0.002), cranial nerve function (29.7% vs. 4.8%, p=0.044), and gait ataxia (35.1% vs. 9.5%, p=0.059). Median OS2 and PFS2 were improved with reRT (OS2: 9.67 vs. 2.57 months, p<0.001; PFS2: 5.63 vs. 1.57 months, p<0.001). OS2 was independently associated with reRT (HR 0.27, p<0.0001), pontine location (HR 2.94, p=0.004), and steroid use at progression (HR 4.12, p=0.001). PFS2 was independently associated with reRT (HR 0.23, p < .0001) and distant pattern of failure (HR 2.83, p=.037). Among reRT patients, non-pontine location was associated with improved OS2 (p=0.02), and local failure was associated with improved PFS2 (p=0.003). Conclusion: ReRT was associated with neurologic improvement and prolonged survival. Patients with non-pontine tumors or local-only failure might derive the greatest benefit. Prospective studies are warranted to define optimal dose/fractionation and refine patient selection.

4

Development and Validation of a Machine Learning Model to Predict Prognosis in Patients with Advanced Head and Neck Cancer

Zhang, K.; Gao, L.; John, D.; Li, W. T.; Hogarth, M.; Coffey, C. S.; Ongkeko, W. M.

2026-05-28 oncology 10.64898/2026.05.27.26354194 medRxiv

Top 2%

0.1%

Show abstract

Importance Prognostic tools beyond staging are needed to guide treatment and counseling in head and neck squamous cell carcinoma (HNSCC). Objective To develop and externally validate a machine learning model predicting survival in advanced HNSCC using routinely collected clinical and biomarker data. Design, Setting, and Participants Retrospective, multi-institutional cohort study including 2,385 patients with stage III-IV HNSCC diagnosed from 2012-2022 in the University of California Health Data Warehouse (UCHDW). Patients were randomly split into training (n = 1,908) and test (n = 477) sets. Partial external validation used 7,749 patients from the Surveillance, Epidemiology, and End Results (SEER) registry (2010-2020). Exposures Demographic, tumor, treatment, comorbidity, and biomarker variables recorded at or before diagnosis. Main Outcomes and Measures The primary outcome was all-cause mortality within 70 months. Cox proportional hazards models included all predictors. Discrimination was assessed with Harrell's concordance index (C-index), calibration with predicted vs observed survival, and stratification with Kaplan-Meier curves. A Random Survival Forest (RSF) was trained for benchmarking and interpretability using Shapley Additive exPlanations (SHAP). Results Among 2,385 patients in UCHDW (median age, 63 years; 29.0% mortality), the Cox model achieved a C-index of 0.735 in the internal test set. Risk quartiles showed clear separation on Kaplan-Meier curves (log-rank p < 0.0001). In the SEER cohort (n = 7,749), where only demographic, staging, subsite, and treatment variables were available, the reduced Cox model achieved a C-index of 0.688, with calibration showing modest underestimation of survival in high-risk groups. Age, T stage, Charlson Comorbidity Index, neutrophil-to-lymphocyte ratio, and platelet count were among the strongest predictors, while surgery was associated with improved survival. The RSF achieved a C-index of 0.758 internally, with SHAP highlighting nonlinear effects of albumin, BMI, and inflammatory markers. Conclusions and Relevance A machine learning model using routine clinical and biomarker data demonstrated good prognostic performance in advanced HNSCC, with partial external validation. Such approaches may support individualized survival estimates, risk stratification, and treatment discussions, but broader validation is required before clinical adoption.

5

Cancer Prevalence and Patterns in Kilifi County: A 10-year Retrospective Descriptive Study

Masha, M.; Mbugua, R. W.; Abdullahi, M.; Sheikh, N. A.; Omar, A.; Abdihamid, O.

2026-06-01 oncology 10.64898/2026.05.20.26353643 medRxiv

Top 3%

0.1%

Show abstract

Abstract Background Cancer is an increasing public health challenge in Kenya, particularly in rural and underserved regions where surveillance systems and diagnostic capacity remain limited. Kilifi County, located along the Kenyan coast, lacks a population-based cancer registry, and data on the local cancer burden is not available. This study aimed to characterize the demographic distribution of patients, cancer burden in the county, and management of cancer cases diagnosed at Kilifi County Referral Hospital (KCRH) over ten years. Methods This retrospective study analyzed the patterns of cancer in Kilifi County using patient records from KCRH during the study period (January 1, 2014, to January 1, 2024). Results A total of 101 patients with cancer were identified, 58% female, with a mean age of 54 years. Most patients were from Kilifi North (47%), with a high proportion reporting no formal occupation (41%) or farming (26%). Esophageal and cervical cancers were the most common (18% each), followed by breast and prostate cancers (5% each), with other malignancies occurring infrequently. Histopathology was the primary diagnostic modality (88%). Staging data were incomplete in 70% of cases; among documented cases, the majority presented with advanced disease (21% stage IV). Due to limited local treatment capacity, approximately half of the patients were referred to tertiary centers for chemotherapy, radiotherapy, or surgery. At data cut-off, 43% had died, 25% were on treatment, and 29% were lost to follow-up, with only 2% completing treatment or under follow-up. Conclusions This study demonstrates a substantial cancer burden in Kilifi County and highlights critical gaps in diagnostic capacity, staging, and continuity of care. Strengthening cancer surveillance systems, expanding diagnostic and treatment infrastructure, and establishing a population-based cancer registry are essential to improving cancer outcomes and advancing equitable care in rural Kenya

6

Redefining Extent Of Resection After Meningioma Surgery: a Multicentre Observational Machine Learning Analysis Comparing Simpson, Radiological and Volumetric Grading

Pandit, A. S.; Deehan, M.; Moudgil-Joshi, J.; Reischer, G.; Mathew, S.; Pace, G.; Fatania, G.; Dalton, A.; Nair, R.; Hyare, H.; Mallon, D.; Kitchen, N.; Marcus, H. J.; Nachev, P.

2026-05-27 oncology 10.64898/2026.05.23.26353944 medRxiv

Top 3%

0.1%

Show abstract

Background: Extent of resection remains central to meningioma management, yet Simpson grading is subjective and may not reflect measurable postoperative residual disease. We compared surgeon-reported Simpson grade, report-derived radiological grading, and residual tumour volumetry across a multicentre cohort. Methods: We performed a retrospective study across two tertiary neurosciences centres comprising four hospitals, including patients undergoing primary cranial meningioma resection from 2006 to 2025. Postoperative magnetic resonance imaging (MRI) reports were harmonised using weakly supervised natural language processing based on term frequency-inverse document frequency (TF-IDF) and a linear support vector machine classifier. Residual tumour volume was segmented from contrast-enhanced postoperative MRI and log-transformed. Concordance between Simpson and radiological gross-total/subtotal resection classification was assessed using absolute agreement and prevalence-adjusted bias-adjusted kappa (PABAK). Cox models assessed recurrence-free survival, with bootstrap validation and anatomical and scan-timing sensitivity analyses. Results: Among 912 patients, recurrence or residual progression occurred in 281. Surgical-radiological agreement was substantial but imperfect (absolute agreement 74%; PABAK 0.61), with lower agreement in skull-base and parafalcine-parasagittal tumours. In adjusted models, recurrence hazard increased with Simpson grade (hazard ratio 1.54, 95% confidence interval 1.37-1.72), radiological grade (1.92, 1.68-2.20), and log-transformed residual volume (1.20, 1.16-1.24; all p<0.0005). Optimism corrected concordance increased from Simpson grade to radiological grade and log-volumetry (0.692, 0.733, and 0.748), with this ranking preserved across sensitivity analyses. Conclusions: Imaging-based postoperative residual disease measures outperformed Simpson grade. TF-IDF-assisted report-derived grading provides a scalable bridge to volumetry, while quantitative residual volume offers the strongest prognostic representation.

7

DISCERN: A Clinical Impact-aware Framework for Radiology Report Comparison

Sharma, R.; Beeche, C.; Dong, J.; Zhuang, R.; Qu, H.; Zhang, R.; Gangaram, V.; Goswami, P.; Xin, J.; Ballard, J.; Goldberg, A.; Sagreiya, H.; Long, Q.; Chen, T.; Witschey, W. R.

2026-05-27 radiology and imaging 10.64898/2026.05.26.26353612 medRxiv

Top 3%

0.1%

Show abstract

The surge in medical imaging has spurred the development of vision-language models (VLMs) to alleviate radiologist workloads. However, clinical deployment is hindered by the lack of meaningful evaluation frameworks. Current metrics - ranging from semantic similarity to large language model (LLM) based judges - often fail to distinguish between clinically trivial and critical discrepancies, poorly reflecting real-world clinical judgment. To address this, we introduce DISCERN (Discordance and Significance-aware Entity-level Radiology Report Comparison). DISCERN is a significance-aware framework that weighs report errors based on their potential impact on patient care. Our results demonstrate that DISCERN powered by closed source LLMs aligns more closely with expert radiologist assessments than traditional metrics or current LLM evaluators, providing a more interpretable and clinically relevant benchmark. By modeling radiologist prioritization and entity-level feedback, DISCERN facilitates targeted model refinement and ensures the safer integration of generative AI into clinical workflows.

8

Cleaner Air for Lower Cardiometabolic Risk: protocol for a double-blind, randomized, sham-controlled trial of HEPA filtration in adults with prediabetes.

Wittkopp, S.; Asachi, P.; Kazatsker, F.; Aleman, J. O.; Gordon, T.; Brook, R.; Thorpe, L.; Newman, J. D.

2026-06-01 endocrinology 10.64898/2026.05.29.26354420 medRxiv

Top 3%

0.1%

Show abstract

Introduction Air pollution is a leading driver of cardiovascular disease with a growing body of literature implicating this in worse glucose homeostasis. Increases in fine particulate matter air pollution (PM2.5) are associated with increased blood glucose and hemoglobin A1c across the glycemic spectrum from normoglycemia to prediabetes to all forms of diabetes. Despite strong evidence for positive associations of PM2.5 with dysglycemia, it remains unknown if reducing air pollution exposure through air filtration can effect improvements in glucose. This study aims to test the hypothesis that short-term, in-home air pollution reduction using high efficiency particulate air (HEPA) filtration will improve blood sugar in adults with prediabetes. Methods and analysis This trial is a randomized, double-blind, sham-controlled trial of the effects of lowering air pollution exposure using HEPA filtration on cardiometabolic health in adults with prediabetes living in the New York City area. Participants will be randomly assigned to use bedroom air cleaners, or sham air cleaners, while measuring PM2.5 continuously for 1 month. The primary outcomes will be continuous glucose monitoring metrics measured before and after HEPA air filtration. Exploratory outcomes will include insulin resistance measures, serum biomarkers and transcriptomics measured before and after HEPA intervention. We will quantify effects of HEPA filtration with models using treatment arm (true versus sham filtration) as the independent variable. Secondary analyses will model continuous measures of PM2.5 as the independent variable. Ethics and Dissemination This study has undergone peer review; and the work was supported by Grant 2023-0214 from the Doris Duke Foundation, who had no other role in study design or implementation. The study was registered in ClinicalTrials.gov (NCT05994937) prior to recruitment. Clinical Trials Clinical Trials NCT05994937; https://clinicaltrials.gov/study/NCT05994937

9

Sensitive Glioma Detection and Recurrence Monitoring Using a Machine Learning Model Based on Circulating Monocytes

Wu, W.; Chai, R.; Xia, P.; Wu, L.; Yu, B.; Chen, X.; Pang, B.; Chen, D.; Wang, Y.; Wang, N.; Li, X.; Liu, H.; Deng, Q.; Wan, F.; Lyu, F.; Wang, L.; Zhang, W.; Zhang, J.; Jiang, T.; Wang, Q.

2026-06-01 oncology 10.64898/2026.05.29.26354409 medRxiv

Top 4%

0.1%

Show abstract

Background: Non-invasive diagnosis, reliable recurrence surveillance remain critical unmet needs in gliomas. Glioma induces profound systemic immune alterations despite its anatomical confinement to the central nervous system. Circulating immune cells, particularly monocytes, are key mediators of tumor-host crosstalk and may retain tumor-induced transcriptional imprints. However, their potential clinical utility as blood-based biomarkers for detection and monitoring, remain largely unexplored. Methods and findings: In this study, we performed integrated single-cell RNA sequencing of blood immune cells and demonstrated that circulating CD14+ monocytes are significantly expanded in glioma patients, exhibiting features of differentiation arrest and increased transcriptional plasticity. These cells harbor glioma-specific molecular signatures distinct from those observed in healthy controls and patients with other tumors. Leveraging these findings, we developed an ensemble machine learning diagnostic model based on transcriptomic profiles of circulating CD14+ monocytes (training cohort, n=107), which achieved a mean area under the receiver operating characteristic curve (AUC) of 0.971 during cross-validation. In an independent cohort of 567 participants, the model maintained high diagnostic accuracy, yielding an AUC of 0.877 for distinguishing glioma from controls and other tumors. And it achieved a recurrence detection AUC of 0.969 in 51 postoperative samples. Moreover, in a prospective follow-up study involving 30 glioma patients, lower model-derived scores of postoperation were significantly associated with prolonged progression-free survival (log-rank test, P=0.043), supporting its prognostic utility. Conclusion: We demonstrate circulating CD14+ monocytes undergo glioma-specific transcriptional reprogramming, generating systemic tumor-associated signal captured via transcriptomic profiling. This blood-based diagnostic model provides non-invasive, scalable approach for glioma detection, recurrence surveillance, outcome prediction.

10

TopBrain Segmentation Challenge for Whole Brain Vessel Anatomy

Yang, K.; Shi, P.; Huang, H.; Musio, F.; Baazaoui, H.; Aydin, O. U.; Hilbert, A.; Hamadache, R. E.; Yalcin, C.; Zhang, M.; Falcetta, D.; de la Rosa, E.; Shit, S.; Prabhakar, C.; Wittmann, B.; Rokuss, M. R.; Kirchhoff, Y.; Al-Maskari, R.; Hoeher, L.; Juchler, N.; Casamitjana, A.; Cleary, J.; Schmick, A.; Baumgartner, P.; Deseoe, J.; Vandans, O.; Lee, D.; Oh, K.; LaBella, D.; Mazher, M.; Niederer, S. A.; Qayyum, A.; Liu, Y.; Chen, J.; Kim, W.; Asawalertsak, N.; Kim, M.; Shin, D.; Park, S.-H.; Kikuchi, S.; Zhang, Y.; Liu, J.; Cui, Y.; Qiu, Y.; Verschuur, A.; Zhang, J.; van der Schaaf, I.; Su, R.;

2026-05-30 radiology and imaging 10.64898/2026.05.28.26354312 medRxiv

Top 4%

0.1%

Show abstract

We present the TopBrain 2025 Challenge, the first benchmark for fine-grained multiclass segmentation of the whole brain vasculature in both computed tomography angiography (CTA) and magnetic resonance angiography (MRA). Building on the TopCoW challenge, TopBrain scales vessel annotation from the Circle of Willis to the entire brain, introducing a dataset of 90 annotated volumes across 48 landmark vessel classes spanning arterial and venous systems, of which 50 training volumes are publicly released. Vessel definitions were consolidated from established neuroanatomical references into a unified annotation scheme, and vessel caliber measurements along the centerline are reported for the first time across the whole brain vascular anatomy. To address the unique challenges of multiclass brain vessel segmentation, we propose an evaluation framework that accounts for detection in segmentation performance, assesses anatomical plausibility, and introduces novel contamination metrics that characterize inter-class prediction errors. Fifteen teams from over 220 registered participants submitted algorithms to the benchmark. The top-performing teams built on nnUNet with principled system design choices, achieving around 80% Dice scores, near-zero invalid neighbor counts, over 60% F1 scores for side-road vessels, and below 18% foreground contamination ratio. Larger vessels are easier to segment, while smaller and more complex vessels remain the true bottleneck. The annotated datasets and podium-finish algorithms are made publicly available on Zenodo.

11

Field-ready portable rapid nucleic acid test for tuberculosis detection and drug-resistance profiling in resource-limited settings

Nag, S.; Banerjee, S.; Banerjee, S.; Ghosh, S.; Bera, A.; Shanmugam, S.; Mondal, A.; Chakraborty, S.

2026-06-01 infectious diseases 10.64898/2026.05.29.26354438 medRxiv

Top 4%

0.1%

Show abstract

Tuberculosis (TB) remains one of the deadliest infectious diseases, with over a million deaths annually and a growing threat from multidrug-resistant strains (MDR-TB). A major bottleneck in controlling TB is the lack of truly portable, rapid, and user-friendly diagnostic systems that can operate effectively in decentralized, resource-constrained settings. Here, we present a first-of-its-kind, portable nucleic-acid-based diagnostic platform that enables both primary TB screening and detection of drug resistance within the same unified framework, without any change in the operative embodiment. The system integrates loop-mediated isothermal amplification (LAMP) targeting dual Mycobacterium tuberculosis markers (IS6110 and IS1081) with a compact, AI-enabled device and smartphone-based readout, delivering rapid and reliable results at the point-of-care. Clinical evaluation across 105 samples demonstrated high sensitivity and specificity. Further validation through real-world deployment in a primary healthcare setting, using a single-gene (IS6110) configuration operated by minimally trained personnel, yielded 95.60% sensitivity and 100% specificity, benchmarked against GeneXpert. Critically, the same platform architecture, without modification, extends seamlessly to drug-resistance profiling, demonstrated here through a probe-free, allele-specific LAMP approach for identifying key mutations associated with rifampicin (rpoB) and isoniazid (katG) resistance. By combining robust molecular diagnostics with AI-driven automation in a compact and accessible format, this work represents a significant medical advancement toward democratizing TB care. The platform thus holds strong potential to enable early screening, guide timely treatment decisions, reduce transmission, and substantially strengthen global TB elimination efforts, particularly in high-burden, low-resource settings.

12

Cancer Medicine Prices, Availability, and Affordability in Kisumu County, Kenya

OKETCH, J. O.; Amolo, S. A.; Onguru, D. O.

2026-05-28 oncology 10.64898/2026.05.27.26354206 medRxiv

Top 4%

0.1%

Show abstract

Background: The rising prices of cancer medicines have intensified concerns about treatment access and health system sustainability particularly in low- and middle-income settings. Systematic facility level evidence on what medicines is actually available, at what prices, and at what cost to patients remains scarce, constraining evidence-based policy reform. Methods: Using adapted WHO/Health action international methodology, we conducted a cross-sectional survey of 52 cancer medicines across five therapeutic classes at five health facilities in Kisumu County, Kenya. Availability was measured as the proportion of facilities stocking each medicine. Affordability was assessed using days' wages required for the lowest-paid government worker to purchase standard treatment regimens, calculated per one chemotherapy cycle and maximum possible cycles. Results: Overall medicine availability was 48.1%, with marked inter-facility variation. Affordability analysis revealed severe financial barriers. The breast cancer AC regimen required 19.6-47.4 days' wages per full course; cervical cancer cisplatin, 19.8-49.2 days' wages; colorectal FOLFOX, 80.0-303.6 days' wages; and prostate docetaxel reached 437 days' wages at the highest-cost facility. The Social Health Authority's (SHA) KES 550,000 annual ceiling adequately covered cytotoxic regimens for common cancers at competitive prices but was exceeded by 24-116% for HER2-positive breast cancer requiring trastuzumab, with further strain for recurrent cervical and metastatic prostate cancers. Conclusions: Cancer medicines in Kisumu County are inconsistently available and highly variable in price resulting in inequitable access. We call for urgent retail price markup regulation, expanded pooled procurement through KEMSA, inclusion of priority targeted therapies on the Kenya Essential Medicines List, and SHA benefit packages redesigned around full-course regimen costs.

13

Multi-Agent AI for Chest Radiography: A Sequential Segmentation and LLM-Driven Consultative Tool for Medical Training

Kurt, F.; Subasi, A.

2026-06-01 health informatics 10.64898/2026.05.29.26354432 medRxiv

Top 4%

0.1%

Show abstract

Background: Traditional diagnostic models lack explainability, while multimodal language models prone to hallucination remain unsafe for medical education. An interactive, risk-free artificial intelligence framework is required to serve as a reliable clinical mentor for radiology trainees. Methods: We propose a multi-agent architecture decoupling deterministic image analysis from generative consultation. Specialized computer vision models perform anatomical localization and pathological segmentation. These quantitative outputs are synthesized into a structured payload, which grounds a locally hosted large language model (LLaVA 7B) using strict prompt guardrails and prerequisite protocols. Results: The system effectively eliminates visual hallucinations by intercepting unanchored queries. The artificial intelligence tutor successfully contextualizes spatial anomalies and baseline metrics, generating accurate conversational explanations and formally structured radiology reports while strictly enforcing medical safety disclaimers. Discussion and Conclusion: By anchoring language generation exclusively to verified algorithmic realities, this framework transforms opaque diagnostic models into safe, interactive educational simulators. This establishes a highly reliable paradigm for integrating explainable artificial intelligence into medical training.

14

Nationwide Trends and Outcomes in Major Gastrointestinal Cancer Surgery

espinoza, r. e. d. a.; Bastos, L. S. L.; Hamacher, S.; Salluh, J. I. F.; Bozza, F. A.

2026-05-27 oncology 10.64898/2026.05.26.26354087 medRxiv

Top 4%

0.1%

Show abstract

Background Complex gastrointestinal (GI) oncologic surgeries carry substantial perioperative risk, and nationwide outcomes in low- and middle-income countries (LMICs) are underreported. This study aimed to evaluate national trends in surgical volume, in-hospital mortality, and intensive care unit (ICU) utilization for major GI cancer surgery in Brazils Unified Health System (SUS) over a 14-year period. Methods A population-based analysis was performed using national administrative databases to identify all adult patients undergoing colectomy, gastrectomy, pancreatic resection or esophagectomy for cancer in the SUS from 2010-2023. Annual rates were age-standardized according to the WHO standard population. Temporal trends were assessed using Poisson regression to estimate average annual percent change (AAPC) with 95% confidence intervals (CIs). Results A total of 179,337 hospital admissions were analyzed (median age 63 years; 48% female). Colectomies accounted for 72% of cases, followed by gastrectomies (19%), pancreatic resections (5%), and esophagectomies (3%). Although crude surgical volume increased, population-adjusted rates declined overall (AAPC -2.09%; 95% CI -2.58 to -1.59), mainly due to reductions in gastrectomies and esophagectomies. Median hospital stay decreased from 9 to 7 days (AAPC -1.93%; 95% CI -2.79 to -1.06). Overall in-hospital mortality declined from 8.1% to 5.7% (AAPC -2.88%; 95% CI -4.15 to -1.59). ICU utilization rose from 37% to 43% of admissions (AAPC +1.31%; 95% CI 0.91 to 1.71). Conclusion Over 14 years, in-hospital mortality and length of stay for major gastrointestinal cancer surgery declined within Brazils universal public health system. These temporal trends occurred alongside expansion of accredited oncology services and increased ICU utilization, although causal relationships cannot be established from administrative data. These findings should be interpreted as hypothesis-generating and highlight the need for more granular hospital-level data in LMIC settings.

15

Keeping human in the loop: A three-phase generative AI workflow for research integrity in data-intensive science.A methodological case study using elite Ethiopian distance-running data

Galko, P.; Yisamaw, A.; Haugen, T.; Seiler, S.

2026-05-29 sports medicine 10.64898/2026.05.29.26354013 medRxiv

Top 5%

0.0%

Show abstract

Background: Generative AI tools can support data-intensive research by writing code, drafting prose, searching analytical possibilities, and stress-testing claims. They can also produce false citations, drift between statistical specifications, and lose continuity across long investigations. This paper describes a practical workflow for using AI systems in empirical research while keeping discovery, verification, and accountability inspectable. Methods: We developed and applied a three-phase human-AI workflow to a case study of 14 elite Ethiopian distance runners. The dataset contained 22,605 GPS-segments collected across 97 consecutive days in late 2025, supplemented by venue and athlete metadata collected in the field. Phase 1 used an autonomous data-exploration tool to pre-filter the hypothesis space across five seeded research questions. Phase 2 used an AI system under direct human guidance to construct candidate findings into numerical claims, verification scripts, and draft text. Phase 3 used an independent AI system in an adversarial role to stress-test methods, statistics, prose, figures, and citations. The workflow was informed by Pearl's distinction between association, intervention, and counterfactual reasoning, with human judgement retained for research direction, interpretation, and final claims. Results: The workflow produced three empirical analyses and a documented correction process. The analyses estimated an altitude-to-sea-level pace correction of +0.10 min/km per 1,000 m at matched heart rate, showed why pooled altitude-surface regression was not identifiable within this venue system, documented method-dependence in heart-rate-based intensity classification, characterised within-venue route variation as a 64/36 path-fixed-to-trail-variable split with the Sululta label resolving into two functionally distinct sub-venues, and reframed the cohort's training through a 3x3x3 prescription lattice grounded in Ethiopian coaching practice. The adversarial phase identified several hallucinated citations, a terminology error between HC1 and cluster-robust standard errors, and several inconsistencies between prose, figures, and computed results. Verification scripts re-derived nearly all numerical claims from the cleaned lap-level data. Conclusions: The case study shows how researchers can organise AI-assisted empirical work so that candidate discovery, claim construction, independent stress-testing, and final accountability remain separated. The workflow did not remove the need for domain expertise or human judgement. Its value was in making the route from candidate finding to manuscript claim explicit, reproducible, and open to challenge. Trial registration: Not applicable.

16

Weight-Guided Constraints for Body Model and Lead Selection in Pediatric CIED MRI Safety Simulations

Hameed, S.; Henry, K.; Jiang, F.; Bhusal, B.; Dillenbeck, H.; Gakenheimer-Smith, L.; Webster, G.; Golestani Rad, L.

2026-05-30 radiology and imaging 10.64898/2026.05.26.26354162 medRxiv

Top 6%

0.0%

Show abstract

Pediatric patients with cardiac implantable electronic devices (CIEDs) face limited MRI access due to RF-induced heating, and computational modeling is increasingly used to characterize this risk. The validity of these simulations, however, depends on pairing body models with clinically realistic lead configurations, guidance that is currently lacking. We retrospectively analyzed 302 CIED surgeries in 281 pediatric patients to derive weight-based constraints for simulation design. Weight alone discriminated epicardial from endocardial lead implantation with AUC = 0.90, and adding age and height yielded no improvement, supporting weight as a sufficient single-parameter selection metric. The probabilistic crossover between approaches occurred at 44~kg, substantially higher than the 10 to 15~kg threshold commonly cited in the literature, with a broad transition zone of 21 to 66~kg in which both lead types were routinely used. Lead length was likewise weight-constrained: only 25~cm leads were observed in patients below 6~kg, and leads of 45~cm or longer were uncommon below 50~kg. These findings yield a three-tier framework, with epicardial-only configurations below 21~kg, dual configurations within 21 to 66~kg, and weight-thresholded lead lengths throughout, enabling MRI safety simulations to focus on clinically realizable anatomy and device combinations.

17

Health and Economic Benefits of Air Quality Improvements in France through Net-Zero Transition Scenarios by 2050

Sharma, A.; Gressent, A.; Real, E.; Nguyen, K. N.; Corso, M.; Pascal, M.; Medina, S.; Wagner, V.; Slama, R.; Colette, A.; Jean, K.

2026-05-28 public and global health 10.64898/2026.05.27.26354123 medRxiv

Top 6%

0.0%

Show abstract

Background: Climate mitigation policies can lower air pollutant concentrations and deliver substantial health co-benefits. The French Ecological Transition Agency (ADEME) proposed four contrasting Transitions 2050 net-zero scenarios. We quantified mortality, morbidity, and health-economic co-benefits from projected PM2.5 and NO2 reductions across all four scenarios in continental France. Methods: Emission projections were input to the CHIMERE chemistry-transport model to estimate PM2.5 and NO2 concentrations for 2030 and 2050. Health impacts were assessed using disease-specific cessation-lag assumptions relative to 2019, covering premature mortality, morbidity, DALYs, and economic benefits across nine outcomes (hypertension, lung cancer, ischaemic heart disease, stroke, COPD, type-2 diabetes, acute lower respiratory infections, and asthma in children and adults). Findings: Population exposure is projected to decline by about 40% for PM2.5 and 70% for NO2 by 2050, with health gains remaining substantial and broadly equivalent across all four scenarios and modest differences between sufficiency-oriented and technology-driven pathways. Under delayed-impact assumptions, avoided premature deaths ranged from 21,300 to 22,100 for PM2.5 and 24,500 to 26,200 for NO2. Morbidity and disability-adjusted life year (DALY) reductions, as well as economic savings, spanned similarly; total avoided morbidity cases were 84,000-88,000, direct medical cost reductions were e1.0-1.1 billion/year, and intangible cost savings of e41-43 billion and e36-39 billion, respectively. Interpretation: Health co-benefits are substantial, consistent across contrasting scenarios, and increase markedly from 2030 to 2050. Explicitly incorporating these co-benefits into climate policy appraisals may strengthen the case for ambitious mitigation and improve decision-maker acceptability.

18

Establishing a framework for human dose prediction in anti-tuberculosis drug development

Patel, A.; Li, A. T.; Solans, B.; Savic, R.

2026-05-28 infectious diseases 10.64898/2026.05.26.26354063 medRxiv

Top 6%

0.0%

Show abstract

Rationale: Efficacious dose selection for anti-tuberculosis drugs has traditionally relied on achieving plasma exposures above the minimum inhibitory concentration, but this approach has not consistently aligned with clinical outcomes. Objectives: We sought to identify early pharmacokinetic-pharmacodynamic targets most predictive of clinical efficacious dose. Methods: We conducted a back-translational, pharmacokinetic-pharmacodynamic simulation-based analysis of 15 anti-tuberculosis drugs. Using pharmacokinetic data from multiple biological matrices and a range of pharmacodynamic metrics, we established candidate exposure-response targets for attainment. We systematically evaluated the predictive accuracy of each target pair against established clinical doses to formulate a decision-making framework linking key drug properties to the most predictive targets. Measurements and Main Results: Depending on the target used, projected clinical doses varied widely - both within and across compounds - highlighting the importance of target selection for dose projection and go/no-go decisions. In general, targeting cellular lesion-level drug exposures relative to in vivo preclinical potency provided an effective approach for early dose selection. However, for highly penetrating drugs, targeting site-of-action therapeutic exposures in the caseum was more predictive of clinical dose. Based on these findings, we developed a preliminary dose prediction tool that enables drug developers to estimate clinically relevant dose ranges of compounds using in vitro and early in vivo data. Conclusions: This work establishes and validates a simple, evidence-based framework to standardize early translational decision-making on dose selection of anti-tuberculosis candidates in development.

19

From CCTA to Surgical Strategy: An Integrated AI Framework for Patient-Specific Coronary artery bypass grafting Planning

Rezaeitaleshmahalleh, M.; Masoumi, S.; Debalme, E.; Sundt, T. M.; Aranki, S. F.; Shin, B.; Nezami, F. R.

2026-06-01 cardiovascular medicine 10.64898/2026.05.28.26354400 medRxiv

Top 7%

0.0%

Show abstract

Background: Coronary artery bypass grafting (CABG) remains the standard of care for complex multivessel and left main coronary artery disease. However, current preoperative planning remains largely subjective, relying on qualitative interpretation of coronary CT angiography (CCTA), operator-dependent stenosis grading, and fragmented multi-software workflows. Invasive fractional flow reserve (FFR), the reference standard for physiologic lesion assessment, is infrequently acquired preoperatively, leaving distal anastomosis planning without an objective hemodynamic basis. Methods: We developed a fully automated, AI-powered platform that converts routine CCTA into a patient-specific CABG planning workflow through five integrated modules: nnU-Net based segmentation of coronary lumen and calcification; quantitative morphological and topological characterization generating more than thirty descriptors; automated stenosis detection using a local reference-radius formulation; a nine-point composite scoring framework for distal anastomosis site selection incorporating luminal caliber, landing-zone length, calcification burden, distal perfusion reserve, and bifurcation proximity; and interactive virtual graft construction coupled to a distributed reduced-order solver for pre- and post-bypass FFR estimation. Results: Lumen segmentation achieved a mean Dice similarity coefficient of 0.96 {+/-} 0.01, whereas calcium segmentation achieved 0.73 {+/-} 0.15 on the held-out cohort. Platform-derived FFR demonstrated strong agreement with invasively measured FFR (r=0.96, mean absolute relative difference 1.73 {+/-}1.42%) across the evaluated lesions, supporting the physiologic validity of the reduced-order hemodynamic solver. End-to-end analysis from raw CCTA to hemodynamic assessment and virtual graft planning was completed in approximately seven minutes per case on a standard workstation, representing a substantial reduction in processing time compared with conventional multi-tool and CFD-based workflows. Conclusions: The proposed platform demonstrates the feasibility of rapid, reproducible, and physiology-informed CABG planning using routine CCTA. By integrating anatomical characterization, automated target-site analysis, virtual graft construction, and reduced-order hemodynamic assessment into a single workflow, the framework provides objective, quantitative surgical decision support compatible with routine clinical workflows. Keywords: Coronary artery bypass grafting (CABG); Fractional flow reserve (FFR); Coronary CT angiography (CCTA); Surgical planning

20

Normative Speech Modeling for ALS Diagnosis with Application to Other Neurodegenerative Diseases

Shah, M.

2026-05-27 neurology 10.64898/2026.05.25.26354057 medRxiv

Top 7%

0.0%

Show abstract

Amyotrophic lateral sclerosis (ALS) is a progressive neurodegenerative disease affecting more than 450,000 individuals worldwide and is frequently diagnosed more than 12 months after symptom onset, delaying intervention during a critical early window. Because up to 80% of patients develop dysarthria within two years, subtle changes in speech provide a signal of early bulbar motor neuron degeneration. However, existing speech-based systems rely on supervised classification trained on limited datasets, achieving moderate sensitivity and depending heavily on labeled disease examples, which restrict scalability and early detection. This study introduces SPEAK-NORM, the first-ever normative speech modeling framework for early ALS diagnosis, which learns age- and sex-conditioned motor-speech distributions exclusively from healthy individuals. A conditional variational autoencoder models coordination of hypoglossal, laryngeal, and respiratory motor pathways, and deviation from this healthy manifold is quantified through latent representations and reconstruction error to form a 354-dimensional profile. A calibrated linear Support Vector Machine performs subject-level classification under subject-disjoint validation. On the VOC-ALS database (n = 153), SPEAK-NORM achieves 98% accuracy with balanced sensitivity and specificity, significantly outperforming established clinical acoustic indices and prior systems. The framework maintains strong performance under cross-task generalization and when retrained on healthy controls in independent dementia and Parkinson disease cohorts, demonstrating disease-specific deviation patterns rather than generic neurodegenerative change. Spectral, temporal, and latent separations further support interpretability. By modeling healthy speech instead of memorizing disease examples, SPEAK-NORM enables scalable early neuromotor screening using recording devices, with potential to support earlier diagnosis, differential classification, and monitoring of ALS progression.